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Abstract 

Contact patterns in populations fundamentally influence the spread of infectious 
diseases. Current mathematical methods for epidemiological forecasting on networks 
largely assume that contacts between individuals are fixed, at least for the duration of 
an outbreak. In reality, contact patterns may be quite fluid, with individuals frequently 
making and breaking social or sexual relationships. Here we develop a mathematical 
approach to predicting disease transmission on dynamic networks in which each in- 
dividual has a characteristic behavior (typical contact number), but the identities of 
their contacts change in time. We show that dynamic contact patterns shape epi- 
demiological dynamics in ways that cannot be adequately captured in static network 
models or mass-action models. Our new model interpolates smoothly between static 
network models and mass-action models using a mixing parameter, thereby providing 
a bridge between disparate classes of epidemiological models. Using epidemiological 
and sexual contact data from an Atlanta high school, we then demonstrate the utility 
of this method for forecasting and controlling sexually transmitted disease outbreaks. 
Keywords:Infectious disease — SIR — Networks — Syphilis 



1 Introduction 

Most epidemic models incorporate a homogeneous mixing assumption, sometimes called the 
law of mass action [Ross(1910) Anderson and May (1991)] Diekmann and Heesterbeek(2000)] , 



whereby the rate of increase in epidemic incidence is proportional to the product of the 
number of infectious and the number of susceptible individuals. This assumption has 
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been relaxed in some compartmental [van den Driessche and Watmougli(2002)| and meta- 
population models [Lloyd and May(1996)[|Finkenstadt and Grenfell(1998)[[Grenfell et al.(2001)[ 



Watts et al. (2005)1 , but not eliminated. The mass action assumption is robust in the sense 



that it is consistent with several scenarios for the individual-to-individual transmission of 
disease. In particular, it is equivalent to a model in which all individuals in a population 
make contact at an identical rate and have identical probabilities of disease transmission to 
those contacts per unit time. Although this assumption is unrealistic, it facilitates mathe- 
matical analysis and, in some cases, offers a reasonable approximation. 

Populations can be quite heterogeneous with respect susceptibility, infectiousness, con- 
tact rates, or number of partners, and simple homogeneous mixing models do not al- 
low for extreme variation in host parameters. New network-based mathematical meth- 
ods capture some, but not all, aspects of population heterogeneity [Callaway et al.(2000)[ 
Strogatz (2001)1 [Newman et al.(2002")||Liljeros et al.(2001)[p\-ndersson(1998)[|Gupta et al.(1989)| 



Newman(2002)[ . Ideally, an epidemic model would incorporate the following realities of 



human-to- human contacts: 

• A given individual has contact with only a finite number of other individuals in the 
population at any one time, and contacts which can result in disease-transmission are 
usually short and repeated events. 

• The number and frequency of contacts between individuals can be very heterogeneous. 

• The numbers and identities of an individual's contacts will change as time goes by. 
The first two points have been addressed previously by static network models [Newman(2002)[ 



Eames and Keeling(2002)j [Meyers et al. (2005b) [ [Meyers et al.(2006)[ . We will focus on the 



third point, and introduce a modeling framework that allows an individual's contacts to 
change in time. 

Concurrent and serial contacts were first shown to be important to HIV transmis- 
sion dynamics in Watts and May(1992)[ Dietz(1988)| , and have been modeled using high- 



dimensional pair-approximation methods [Altmann(1995)[ Altmann(1998)[[Eames and Keeling(2004)| 



and moment closure methods on dynamic contact networks Bauch(2002)| . Here, we intro- 
duce a low-dimensional system of non-linear ODEs to model susceptible-infected-recovered 
(SIR) epidemics in a simple class of dynamic networks. We use this model to character- 
ize the impacts of population heterogeneity and contact rates on epidemic dynamics. We 
show further that the model reproduces basic classes of epidemiological models, such as the 
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standard mass-action SIR model and static network model, as the parameter which controls 
population mixing varies. 

2 Neighbor-Exchange model 

Human contact patterns are potentially complex, as the numbers and intensity of contacts 
can vary considerably across a population. Furthermore, contacts are transitory events 
such that the identities of one's contacts change in time. To capture such heterogeneity, 
we introduce the neighbor-exchange (NE) model as a simple extension of a static contact 
network model. In this model, an individual's number of concurrent contacts remains fixed 
while the composition of those contacts changes at a specified rate. The model assumes 
that at any given time, an individual will be in contact with an individual-specific number 
of neighbors with whom disease transmission is possible. Each contact is temporary, lasting 
a variable amount of time before coming to end, at which point the neighbor is replaced by 
a different individual. 

Let the population of interest consist of n individuals, each of which falls into one of three 
exclusive states: susceptible, infectious, or recovered. At some time t, an individual ego will 
have kego contacts with other individuals (i.e., alters): {ego, alteri), {ego, alter2), ■ ■■ , {ego, alterk). 
Only undirected contact networks will be considered such that if there exists a contact 
{ego, alter) there will also be a contact {alter, ego) . In network terminology a directed link, 
denoted {ego, alter), is called an arc. An undirected link, denoted {ego, alter}, is called an 
edge. The degree of a node ego is the number of edges connected to the node. The term 
contact will specifically be used to denote a directed arc in the network, where two arcs cor- 
respond to each undirected edge. The k-degree of a node will be the number of concurrent 
contacts to/from the node. 

The neighbor exchange model assumes that the identities of a node's neighbors will 
continually change while the total number of current neighbors remains constant. This 
occurs through an exchange mechanism in which the destination nodes of two edges are 
swapped. For example, two nodes ego and ego' with distinct contacts {ego, alter) and 
{ego' , alter') may exchange contacts such that these are replaced with {ego, alter') and 
{ego' , alter). There are two edges and four contacts involved in each edge-swap. The fate 
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Table 1: Notation for epidemic and network parameters. 



s 


The fraction of population susceptible. 


I 


The fraction of population infectious. 


R 


The fraction of population recovered. 


S 


Set of susceptible nodes. 


I 


Set of infectious nodes. 


n 


Set of recovered nodes. 


J 


Cumulative epidemic incidence {J = I + R) 


r 


Transmission rate. 




Recovery rate. 


P 


Mixing rate. 




Degree of node ego. 


Pk 


The fraction of nodes with degree k. 



of each edge and contact is summarized in the following pseudo-chemical equation: 

{ego, alter} + {ego', alter'} — > {ego, alter'} + {ego' , alter} 
{ego, alter) + {ego' , alter') — > {ego, alter') + {ego' , alter) (1) 
{alter, ego) + {alter' , ego') — > {alter, ego') + {alter' , ego) 

In the model, any given contact {ego, alter) will be reassigned to {ego, alter') at a constant 
rate p. Equivalently, edges are swapped at a rate 2p. 

For mathematical tractability, we make a few simplifying assumptions about the epidemic 
process. First, for the duration of a contact, infectious individuals transmit disease to 
neighbors at a constant rate r. Second, infectious individuals become recovered at a constant 
rate /i. 

We also simplify the mathematics by considering only the simplest category of hetero- 
geneous networks: semi-random networks which are random with respect to a given degree 
distribution MoUoy and Reed(1998) Molloy and Reed(1995)| . The degree distribution will 
have density p^, which is the probability that a node chosen uniformly at random has k 
concurrent contacts. Note that a node's degree never changes during an edge-swap and thus 
the degree distribution is preserved. 

The simultaneous epidemic and network dynamics described above collectively determine 
the neighbor-exchange model. An even more realistic model would allow the number k of 
concurrent contacts of a node to vary stochastically, but the current model offers a valuable 
first step toward understanding epidemiological processes on dynamic host networks. 
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Table 2: Notation for network and epidemic quantities. 

A^x Set of contacts {ego, alter) s.t. ego £ X. 

A.XY Set of contacts (ego, alter) s.t. ego £ X and alter £ Y. 

Mx Fraction of contacts in set A.x- 

MxY Fraction of contacts in set AxY- 

pi = Msx/Mg Fraction of contacts from susceptibles which go to infectious nodes. 

PS = Mss/Ms Fraction of contacts from susceptibles which go to other susceptible nodes. 

2.1 Dynamics 

We will expand on the dynamic probability generating function (PGF) techniques first in- 
troduced in |Volz(2007a)| to model SIR-type epidemics in static networks. These techniques 
are powerful and are easily extended to consider dynamic contact networks. We start with 
an overview of the basic model and then introduce additional terms that model the neighbor 
exchange process. 

The concurrent degree distribution pk will be generated by the PGF 

g{x) = pq + pix + p2X^ + Pzx-^ ^ (2) 

The dummy variable x in this equation serves as a placeholder for dynamic variables. 

Let Ax be the sets of contacts (arcs) where ego is in the set X. We will consider the 
sets of all susceptible, infected and recovered nodes, denoted X — S , X = J , and X ^ TZ, 
respectively. Mx — #{.4x}/#{.4} is then defined as the fraction of total contacts in set 
Ax- Now let AxY be the set of contacts such that ego is the set X and alter is in the set 
Y, and Mxy — ^{Axv} /#{A} be the fraction of total contacts in set Axv- For example, 
Mss is the fraction of all arcs in the network that connect two currently susceptible nodes. 

The following derivation assumes that each contact of a susceptible node (ego, alter) £ 
As has a uniform probability that alter G 2, denoted pi — Msi/Mg, and a uniform 
probability ps = Mss/AIs that alter £ S. A degree k susceptible node has an expected 
number kpj contacts with infectious nodes; and, in a small time dt, an expected number 
rkpj dt of a degree k susceptible nodes' contacts will transmit disease to that node. The 
instantaneous hazard of infection for a degree k susceptible node is then given by 

Afe(t) = rkpiit) (3) 

Let Uk{t) denote the fraction of degree k nodes remaining susceptible at time t. Equation 
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[3] impl: 



ICS 



-Afe(T)dr} 



Uk{t) = exp{ J 
= exp{ J 

= exp{ / -rpi{t)dT} 

Jt=0 



=0 

-rkpi{t)dT} (4) 



r=0 

-rpi{t)dT^'' 

'r=0 

Now let = be the fraction of degree k = I nodes in the network which remain 

susceptible at time t. Equation [l] implies that Uk = 0^ ■ (Henceforth, variables that are 
clearly dynamic, like 6, appear without a time (t) variable.) 

We use the PGF for the network degree distribution (equation [2| to calculate the fraction 
of nodes that remain susceptible at time t. 

S = pa + uipi + U2P2 H = ^PkO'' = g{9). (5) 

k 

The dynamics of 9 in equation [5] are given by 

9 = ^u, = -X^e - -rpj9. (6) 
at 

Unfortunately, this does not form a closed system of differential equations, as equation [6] 
depends on the dynamic variable pj. From the definition of pj we have 



d Msi _ Msi MsiM s 
dt Ms ~ Ms Ml 



To obtain the derivatives of Mgj and M5, we observe that, in time dt, —S nodes become 
infectious, resulting in modifications to the sets Asi, AjSj and As- In particular, any given 
arc from newly infected individual was formerly in one of ^455, Asi, or Astz, smd is now 
in one of Ajs, Ajj, or A-m- The ~S new infecteds are not selected uniformly at random 
from the susceptible population, but rather with probability proportional to the number of 
contacts to infectious nodes. 

We pause for a few definitions. First, it is useful to break the degree of a node into 
three quantities: the number of contacts to currently susceptible, infected, and recovered 
nodes. We refer to these as X-degrees, where X — S, I, or R, respectively. Second, imagine 
following a randomly chosen contact to its alter node and counting all the edges emanating 
from that node, except for the one on which we arrived. We call the resulting total quantity 
the excess degree of the node, and the resulting neighbor-specific quantities excess X-degrees 
of the node, where X indicates one of the three possible disease states. 
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Wc introduce the notation 5x.y{Z) to represent the average excess Z-degree of nodes 
currently in disease state X selected by following a randomly chosen arc from the set Ayx^ in 
other words, the excess Z-degree of a node of type X selected with probability proportional 
to the number of contacts to nodes of type Y . We further define 5x,y to be the average 
(total) excess degree of nodes currently in disease state X selected by following a randomly 
chosen arc from the set Ayx- For example, imagine first randomly choosing an arc from 
Axs^ then following that arc to its destination (susceptible) node, and finally counting all 
of the other edges emanating from that node (ignoring the one along which we arrived). 
Then 5s,i{S), 6sj{I), and 5s.i{R) give the average number of contacts to other susceptible, 
infected and recovered nodes chosen in this way, respectively; and ^5./ gives the average 
total number of contacts emanating from nodes chosen in this way. 

Using this notation, the equations for and are as follows (for more details, 

see |Volz (2007a)] ). 

S - j^9{9) - eg'{9) = ~rpieg\e) (8) 
Msi = {{-S)5sj{S) - {-S)6sj{I))/g'{l) - {r + fi)Msx (9) 

Mss = ~2{~S)ds,i{S)/g'{l) (10) 
Ms = j^9g'{0)/g'{l) - ~{rpj0g\e) + rpj0^g"{0))/g'{l) (U) 

The calculations of the 5x,y{Z) are straightforward, and based on the current degree distri- 



bution of susceptible nodes. The calculations are given in the supplement Electronic Supplement: http://?| 
and in |Volz (2007a) | , and result in the following. 

5sAl)^PiO9"{O)/g'{0) (12) 
5sj{S)=PsOg"{0)lg'{0) (13) 

Combining the equations for Mg , Ms , and Mgi yields the dynamics of pi in terms of 
the parameters r and /i, the PGF .g(-), and the dynamic variable pg. The resulting model is 
given in table |3] The dynamics for ps complete the model, and can be derived analogously 



to the equation for pi (see the supplement Electronic Supplement: http://?], for details) 



2.1.1 Dynamic contact networks 

We now extend the model given in table |3] to allow neighbor exchanges (NE) at a rate p. 
First consider 0. An edge-swap (equation [ij will affect the arrangement of edges among 
susceptible, infectious, or recovered nodes, however it will never directly cause a node to 



7 



Table 3: System of equations used to model the spread of an SIR epidemic in a static 
semi-random network. 



e = 


—rpi0 


PS = 


rpsPi (1 - Og"{0)/9'(e)) 


Pi = 


rpipseg"{e)/g'{e) - pj(l - pi)r - pjti 


s = 





change its disease state. The dynamics of 6 are only indirectly influenced by NE dynamics, 
through changes to pi, and thus equation [6] still holds. 

NE does, however, affect the composition of contacts between susceptible and infectious 
nodes. We postulate that the equations for pi and ps can be expressed in the following 
modified forms. 

PI = rpiPs0g"{e)/9'{0) - Pi{l - Pi)r - Pill + fiipi, Mi) (14) 
PS - rpspi [l - + /s(Ps, Ms) (15) 

where the functions fx{',') represent the contribution of NE dynamics to the system. 

NE dynamics can both decrease or increase the values of pi and ps- First consider the 
decrease of pi due to NE dynamics. 

• At rate p, a given contact (ego, alter) will transform to (ego, alter'). 

• Given that (ego, alter) e ^5, alter G X with probability pi. 

• With probability 1- Mi, alter' ^ I. 

Thus Pi will be decreased by NE dynamics at a rate ppi(l — Mi). 
Similarly, pi can increase due to NE as follows. 

• At rate p, a given contact (ego, alter) will transform to (ego, alter'). 

• Given that (ego, alter) € As, alter ^ X with probability 1 — pi. 

• With probability Mi, alter' e J. 

Thus Pi is increased by NE dynamics at rate p(l —pi)Mi. We add the positive and negative 
contributions to calculate the total influence of NE on pi. 

fi(pi,Mi)=p(Mi~pi) (16) 
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Table 4: System of equations used to model the spread of an SIR type epidemic in a dynamic 
semi-random network with stochastic exchange of neighbors at constant rate p. 

e = -rpi9 

PS = rpspi (1 - eg"{e)/g'{0)) + p {g'ie)/g'{l) - ps) 

Pi = rpips8g"{0)/g'{e) - pj(l - p/)r - pifi + p (A/j - pi) 

Mj = -p.Mi + rpi {e^g"(e) + Og' {9)) /g'jl) 

S = g{e) 



By similar reasoning, we determine fs{psi Mj). 

fs{ps,Ms) ^ p{Pr[alter i S] x Pr[alter' e S] - Pr[alter £ S] x Pr[alter' ^ S]) 

(17) 

= p {Ms ~Ps)^P {9'{0)/g'{l) - ps) 

The complete system of NE-adjusted equations is reported in table [4] The dynamic 
variable Mj, however, appears in equation [17] and cannot be put in terms of the variables 
0,ps, and pj. Therefore a fourth dynamical equation must be included for Mj, which is 
listed in table |4] Fortunately, the dynamics are very straightforward. Recall that Ssj 
denotes the average excess degree of susceptibles chosen by following randomly selected arcs 
between infectious and susceptible nodes, and thus indicates the average excess degree for 
individuals who become infected in a small time interval dt. We add one {Ssj + 1) to obtain 
the average degree for such newly infected nodes. Then, recalling that Mj decays at rate p, 
we have 

MI^i~S){SsJ + l)^^^MJ (18) 

All of the results that follow will assume that a very small fraction e of nodes are initially 
infected, and thus there is only a very small probability that two initially infected individuals 
contact each other. Then we anticipate the following initial conditions |Volz(2007a)| : 

= Mi = e (19) 
PI = e/(l - e) (20) 
ps = {l-2e)/{l-e) (21) 

2.2 Convergence to a mass action model 

In the limit of large mixing rate (p — )■ oo) , the probability of being connected to a susceptible, 
infectious, or recovered node is directly proportional to the number of edges emanating from 
nodes in each state. Referring to table |4] it is clear that pj converges instantly to AIj and 
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Ps converges instantly to Ms- Here, we show that, as the mixing rate grows, the underlying 
network structure becomes irrelevant and the model converges to a mass action model. 

To see this, we replace every occurrence of the variable pj in the system of equations in 
table g] with Af/. Then 

e = -rMiO, (22) 

and, 

Mi - ^ (eg'ie) + e''g"ie)) - f,Mj (23) 

Neither equations [22|or[23]depend on ps, and thus together form a closed system of equations 
which describe the epidemic dynamics. These equations incorporate arbitrary heterogeneity 
in contact rates, but no longer consider an explicit contact network. When we assume 
that contact rates are homogeneous throughout the population, than these equations are 
equivalent to a simple SIR compartmental model. To illustrate, we retrieve the standard 
SIR dynamics by setting g(x) = x, which means that every node has exactly one concurrent 
contact. In such a population, the number of arcs to infectious individuals is exactly equal 
to the number of infectious nodes, that is, Mj = /. Then, substituting into equations |22| 
and[23l we reproduce the standard equations: 

S = g{e) = e (24) 
e = -rie = -rIS (25) 
Ml = rMiO - fiMi = rIS - fil (26) 

Equations [22] and [23] are potentially extremely useful, as they incorporate arbitrary 
heterogeneity in a system of equations no more complex than the standard compartmental 
SIR model. 

Figure [l]demonstrates the convergence of the NE model to the corresponding mass-action 
model for a Poisson degree distribution {r — fi — 0.2, z — 1.5). The circles indicate the 



solution to the mass action model (equations 22 and 23 1 . We observe that the convergence 



is quite rapid as p is increased in multiples of /i. This supports the common assumption 
that the mass-action model is a reasonable approximation for populations marked by many 
short-duration contacts. 

3 Stochastic simulations 

To test the NE model, we compare its predictions to stochastic simulations of an analogous 
epidemic process in networks. We first generate semi-random networks using the configura- 
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Time (t) 



Figure 1: Trajectories of the NE model (solid lines, table [4]) are compared for several values 



of the mixing rate (p) to an analogous mass-action model (circles, equations 22 and 23 1 
The degree distribution is Poisson (z = 1.5) and r = /i = 0.2. 



tion model Molloy and Reed(1995)| . The epidemic simulations then proceed as follows: 



1. One node is selected uniformly at random from the population to be patient zero, the 
first infected individual. 

2. Each contact a Cz A has an exchange time AtE{a) drawn from an exponential distri- 
bution (parameter p). This time is added to a queue. 

3. When a node v is infected at time t, a time of infection Atj{v,a) is drawn from an 
exponential distribution (parameter r) for each contact a = {ego, alter) such that v is 
ego. The time t + Atj(v, a) is added to a queue. 

4. When a node v is infected at time t, a time of recovery Atri{v) is drawn from an 
exponential distribution (parameter fj,) and is assigned to v. The time t + Atji{v) is 
added to a queue. 

5. When t' — t + AtE^a) is the earliest time in the queue, an edge-swap is performed, as 
per equation [l] The first edge involved in the swap corresponds to the contact a. The 
second edge is selected by choosing a unique element out of the of all edges uniformly 
at random. Then a new time t' + AtE^a) is drawn and added to the queue. 

6. When t' = t + Ati{v,a) is the earliest time in the queue, a transmission event will 
occur, providing v has not recovered. Node v transmits to whatever node is occupying 
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the position of alter at that time, causing alter to change state to X if currently 
susceptible. If a transmission event occurs, a new time t' + Ati(v,a) is drawn and 
added to the queue. 

7. When t + Atfi(v) is the earliest time in the queue, the corresponding node v enters 
a recovered state such that any transmission event with v — ego is removed from the 
queue. 

This process continues until there are no more transmission events in the queue. 

Figures [2] and [3] show a comparison of one thousand stochastic simulations to the solution 
of the NE-SIR equations for two concurrent degree distributions: 

• Poisson. pk = z^e^^ /k\, fc > 0. This is generated by g{x) — e^^^~^\ 

• Power law with cutoff, — ^^"/X]i=i* k > l,k < k. This is generated by 

Figure |2] depicts epidemics on a network with a Poisson degree distribution (z = 1.5) and 
parameters r — 0.2, /i — 0.1, and p — 0.25. Figure [3] shows epidemics on a network with 
a power law degree distribution (a = 2.1, k = 75) and parameters r 0.2, /i ~ 0.1, and 
p — 0.20. The deterministic NE model (table |4| predicts a trajectory which passes through 
the central-most region of the swarm of simulation trajectories and shows good agreement 
with the final size. There is nevertheless a great deal of variability among the simulation 
trajectories in terms of the onset of the expansion phase-the point in time when the epidemic 
increases at its maximal rate. At the onset of expansion phase, all trajectories are more or 
less similar, in agreement with the NE model. 

In contrast to the homogeneous Poisson network, the power law gives an almost imme- 
diate expansion phase. This can be understood by noting that the hazard of infection is 
proportional to pi, and initially 

There is a ratio of PGF's in equation [27} 

5"(1 - e)/g'{l -e)^ g"{l)/g'{l) = - 1 

This is approximately the ratio of the second moment to the first moment of the degree distri- 
bution, which for the power-law approaches infinity as the cutoff k — > 00. Because the ratio is 
very large, power-law networks have almost immediate expansion phase. [Barthelemy et al.(2005)" 



Pastor-Satorras and Vespignani(2001b)[ Pastor-Satorras and Vespignani(2001c)[ Boguna et al.(2003)] 
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Figure 2: 1000 stochastic simulations (small dots) are compared to the predicted trajectory 
of a NE epidemic (large dots) in a Poisson network {z = 1.5).r = 0.2, fj, = 0.1, and p = 0.25. 



9000 




100 



Time (t) 

Figure 3: 1000 stochastic simulations (small dots) are compared to the predicted trajectory 
of a NE epidemic (large dots) in a power-law network (a = 2.1, k = 75). r = 0.2, fi = 0.1, 
and p = 0.20. 
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4 Application of the NE model to syphilis outbreak 
among Atlanta adolescents 

We demonstrate the utihty of the NE model using link-tracing data from a 1996 outbreak 
of syphilis among Atlanta adolescents. Sexual network data typically report contact time, 
duration of a contact, and frequency of interaction. Prior network models have typically not 
taken into account the serial aspect of sexual contacts, and instead assume that all contacts 
reported in a survey are constant over the duration of an epidemic or infectious period. 
Here, we illustrate that the dynamic SIR network model (table |4]) can explicitly capture the 
transitory nature of sexual contact patterns. 

We use public-health data from an outbreak of syphilis within an adolescent community 
centered on an Atlanta high-school Rothenberg et al.(1998)| . Initially, several adolescents 



diagnosed with syphilis were interviewed by epidemiologists. The sexual contacts of these 
respondents were then traced and interviewed. In all, 34 people were interviewed and 204 
contacts were traced. Each interviewee named their sexual contacts and listed the date of 
their first and last interaction with each contact. 

The complexity of syphilis transmission dynamics Garnett et al.(1997)| and the small 



size of our data set make modeling the 1996 outbreak quite difficult. The following results 
should thus be taken with the caveat that there is significant uncertainty in the estimated 
rates and parameters, particularly the host transmission and recovery rates. Below we will 
show the impact of these parameters on the expected final size of a syphilis epidemic. 



We estimate the relevant parameters using equations given in the supplement [Electronic Supplement: http:// 



and in |Heckathorn(2002)[ [Salganik and Heckathorn(2004)[ |Volz and Heckathorn(2007b)| . 



In brief, a typical syphilis infection can last about a year if left untreated, and a typi- 
cal infectious period will last 154 days on average |Jones(2005)| . A convenient estimate 
of the recovery rate is then /t — 1/154 — 0.0065. This estimate ignores many features 
of the pathology of syphilis for mathematical convenience, such as different probabilities 
of recovery at different stages of the infectious period Garnett et al.(1997)| . There are 



diverse estimates for the transmissibility of syphilis, ranging from 9.2% to 63% per part- 
ner Garnett et al.(1997)| . The estimate of 62.7% was selected as the most credible by the 



authors in Garnett et al.(1997)| . 



Using the contact durations reported in the Atlanta study, we estimate the mixing 
rate of the population to be p = 0.032 Electronic Supplement: http://?| . We then use 



the reported numbers of contacts to estimate fcggo, the average number of concurrent con 
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Figure 4: The final epidemic size as predicted by the NE model (table [4]) is shown with 
respect to the transmission rate r and recovery rate /i for the Atlanta syphilis data. Lighter 
colors correspond to larger final size, as given by the color bar on right. The thick black 
line corresponds to the ratio r/ji that gives the expected transmissibility oi t — 0.627. The 
large data point indicates the expected recovery rate of /i = 1/154. 



tacts for each individual ego in the sample. The degree distribution pk can then be esti- 
mated jElectroni£^Suppleme Salganik and Heckathorn(2004)[ 



Volz and Heckathorn(2007b)| from the sequence ki, which is well fit by a power law with 



exponent a — —2.66. For the power-law fit, x^/n = 0.018. An exponential distribution 
provided a worse fit to the data with /n — 0.632. We assume the estimated degree 
distribution in the following analysis. 

Figure |4] shows the final size of an outbreak as predicted by the NE model over a broad 
range transmission and recovery rates. The black line (upper left) corresponds to the ratio 
of r to that yields a transmissibility r = 0.627. If we suppose a transmissibility of 62.7% 
and a recovery rate of 1/154 — 0.0065 (the white circle in figure |4]), the final size is expected 
to be 97%. This is not consistent with the observed outbreak or an estimated prevalence of 
35% [Electronic Supplement:"|http://?[ [Heckathorn(2002)[ [Salganik and Heckathorn(2004)| 



Volz and Heckathorn(2007b)| . As figure |4] shows, however, slight alterations in r or can 



drastically impact the predicted outbreak size. In particular, therapeutic intervention, which 
certainly occurred during the 1996 outbreak, increases the effective recovery rate and thereby 
decreases the ultimate attack rate. Therefore the estimate of 97% should be considered a 
worst-case scenario given the behavioral parameters (mixing rate and degree distribution) 
estimated for this adolescent population. 
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5 Discussion 



Human contact patterns are characterized by heterogeneous numbers of transitory contacts. 
If contacts change at a rate which is slow relative the rate of epidemic propagation, then 
static network approximations such as those based on bond-percolation Callaway et al.(2000)[ 
Newman(2002)[ [Meyers et al. (2005b) [ [Meyers et al.(2006)] may be appropriate. On the 



other hand, if contacts have very short duration relative to epidemic dynamics, then static 
network approximations break down and a mass-action model is more appropriate (equa- 
tions 22 and 23 1. In between these extremes, contacts are neither fixed nor instantaneous. 



and accurate epidemiological forecasting requires models that explicitly capture their dy- 
namics, such as the NE model developed here. In fact, by changing a single parameter 
(the mixing rate), the NE model crosses the spectrum of models from static network to 
mass-action. 

The NE model is particularly useful for building models from link-tracing data. For 
many data sets, standard mass action models do not adequately capture the finite number 
and extended-duration of contacts, while static network models ignore the transitory and 
serial nature of contacts. Using the example of a 1996 syphilis outbreak in an adolescent 
population, we showed that the NE model can be easily fit to sexual contact data and then 
used to explore the epidemiological implications of host population structure. 

In the limit of large mixing rate, the NE model becomes a simple (low-dimensional) 



mass-action model (equations 22 and 23 1 that captures SIR dynamics in populations with 
arbitrary heterogeneity of contact rates. It reduces to the standard mass-action model when 
one assumes that all individuals have the same number of contacts. The mass-action model 
could potentially find wide utility in populations which are heterogeneous with respect to 
contact rates, infectiousness or susceptibility, specifically for modeling highly contagious dis- 
eases (such that brief contacts lead to transmission) or slow-propagating infectious diseases 
(such as many STD's) - In either case, epidemic propagation is slow relative to the turnover 
in contacts. 

We check our mathematical results using simulations which model continuous-time stochas- 
tic processes (both social and epidemiological) and take into account the finite size and 
heterogeneity of the population (section |3|. We wish to highlight our specific simula- 
tion techniques as an interesting alternative to the commonly used chain-binomial simula- 
Daley et al.(2001)[ . 



tion 



The NE model offers a flexible starting point for analyzing epidemiological processes in 
dynamic networks. It should be fairly straightforward to extend the model to populations 
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with simple spatial heterogeneity or assortative mixing by type [Newman(2003)| . Modeling 
a dynamic population in which the number of concurrent contacts varies in time, however, 
will likely require a different approach. 



The authors thank Richard Rothenherg for providing the Atlanta dataset and useful com- 
ments. LAM acknowledges grant support from the James S. McDonnell Foundation. 
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